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The present paper deals with the problem of allocating patients to 
two competing treatments in the presence of covariates or prognostic 
factors in order to achieve a good trade-off among ethical concerns, 
inferential precision and randomness in the treatment allocations. In 
particular we suggest a multipurpose design methodology that com- 
bines efficiency and ethical gain when the linear homoscedastic model 
with both treatment /covariate interactions and interactions among 
covariates is adopted. The ensuing compound optimal allocations of 
the treatments depend on the covariates and their distribution on 
the population of interest, as well as on the unknown parameters 
of the model. Therefore, we introduce the reinforced doubly adap- 
tive biased coin design, namely a general class of covariate-adjusted 
response-adaptive procedures that includes both continuous and dis- 
continuous randomization functions, aimed to target any desired allo- 
cation proportion. The properties of this proposal are described both 
theoretically and through simulations. 

1. Introduction. In the medical profession physicians are expected to act 
in the best interests of each patient under their care, but in clinical trials 
the patients are statistical units in an experiment, and the demands of in- 
dividual care and experimental information often come into conflict. Thus 
the ensuing ethical problem is how to balance the welfare of the patients 
in the trial against a possible knowledge gain that will improve the care 
of future patients. In experimental medicine it is commonly believed that 
randomized trials are the answer, especially in the case of Phase III clinical 
trials, where the aim is to compare the efficacy of two available treatments 
and patients are sequentially randomized to one of them. In this context, 
several design methodologies have been recently proposed in order to derive 
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suitable target allocations of the treatments that represent a compromise 
between ethical demands and inferential goals [4, 17, 22]. These targets de- 
pend in general on the unknown model parameters and can be implemented 
by adopting suitable response-adaptive procedures, such as the sequential 
maximum likelihood design [15], the doubly adaptive biased coin design [13] 
and the efficient randomized-adaptive design (ERADE) [14], converging to 
them. 

An additional peculiarity of Phase III trials is that they usually involve 
some set of important prognostic factors or covariates. The role of these 
factors may be crucial in order to derive correct inferential conclusions about 
the treatment effects and this is one of the reasons for which taking into 
account the covariates has now become of primary importance not only 
from an inferential point of view but also from a design perspective (for 
a recent literature [2, 5, 18, 20]). 

In the last decade there has been a growing statistical interest in the topic 
of adaptive designs adjusted for covariates and, in particular, in covariate- 
adjusted response-adaptive (CARA) randomization methods; see [12, 19, 
23]. This is a class of sequential allocation procedures that modifies the prob- 
abilities of treatment assignments on the basis of the available information — 
that is, earlier responses and allocations, past covariate profiles and the 
covariate information of the present patient — with the aim of skewing the 
allocations towards the treatment that appears to be superior or, more gen- 
erally, of converging to a desired target that should incorporate inferential 
demands related to optimal inference about the treatment effects and, even- 
tually, ethical concerns. 

Even if in the presence of prognostic factors the inferential methods, as 
well as the ethical goals, change on the basis of the nature of the responses 
and the covariates, the design literature has essentially focused on the simpli- 
fied scenario of absence of treatment/covariate interactions; see, for instance, 
[3, 7, 8]. In this context, the relative performance of the treatments is the 
same for every subject's profile, so that the ethical demand simply consists 
of allocating the best treatment to as many patients as possible, indepen- 
dently on their covariates. This also explains the absence of methodological 
proposals through which one can derive target allocations that, by incorpo- 
rating both inferential and ethical considerations, depend on the covariates. 
Whereas, in the more complex scenario of treatment/covariate interactions, 
the covariates play a fundamental role also from an ethical viewpoint, since 
the superiority /inferiority of a given treatment, as well as the discrepancy 
between the treatment effects, depend on the subject's profiles. 

Therefore, one of the primary aims of the present paper consists of analyz- 
ing, from a design perspective, the linear model with both treatment/covari- 
ate interactions and interactions among covariates, in terms of ethical impact 
as well as inferential efficiency. After deriving the analytical expressions of 
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the most popular information criteria, we propose a multipurpose design 
strategy based on a compound optimization approach, that combines infer- 
ential precision and ethical gain by means of flexible weights, which can be 
fixed a priori by the experimenter, or they may be functions of the unknown 
model parameters. 

This multipurpose criterion leads to a locally optimal allocation which 
depends, in general, on the covariates and their population distribution, as 
well as on the unknown model parameters, and allows to promote for ev- 
ery profile a suitable compromise between information and ethical demands. 
Furthermore, we introduce the reinforced doubly adaptive biased coin de- 
sign (RDBCD), namely a new class of CARA procedures that generalizes 
some earlier works [1, 23] and also extends several procedures proposed in 
the literature, such as the doubly adaptive biased coin design and the ER- 
ADE, to the covariate setting. The RDBCD, which admits both continuous 
and discontinuous randomization functions, can target any desired alloca- 
tion proportion, allowing also to force closeness to the chosen target in an 
appropriate way, while maintaining randomization. We show, both theoret- 
ically and through simulations, that the proposed procedure has desirable 
properties, asymptotically and, in particular, for small samples. 

The paper is structured as follows. Starting from the notation in Section 2, 
Sections 3 and 4 deal with the optimal designs for inference and ethics, re- 
spectively. Section 5 describes the combined approach, while the properties 
of the ensuing optimal compound target are discussed in Section 6. In Sec- 
tion 7 we introduce the RDBCD, showing its asymptotic properties as well 
as the asymptotic inference related to the adoption of such a procedure. 
Section 8 deals with some finite sample comparisons between our proposal 
and some of the fundamental procedures proposed in the literature. We end 
the paper with a brief discussion in Section 9. 

Motivated by the clinical practice, the present paper takes into account 
categorical covariates, since in the large majority of real Phase III clinical 
trials the prognostic factors are polytomous and, even if quantitative, they 
are often categorized by adopting suitable thresholds. Furthermore, for ease 
of notation we deal with just two covariates, but the extension to the case 
of several factors is straightforward (see [6]). 

2. The linear model with covariates. Let A and B be two competing 
treatments. We suppose that for each subject entering the trial we observe 
a vector Z of concomitant categorical variables. Moreover, we assume the 
covariates to be random, that is, they are not under the experimenters' 
control when the subjects turn up for the trial, but they can be measured 
before assigning a treatment. Then the treatments are assigned according 
to a given randomization rule, with 5 an indicator variable such that 6 = 1 
or if the subject is assigned to A or B, respectively, and an outcome Y 
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is observed. Conditionally on the covariates and the treatments, patients' 
responses are assumed to be independent. A common model for the response 
that accounts for treatment/covariate interactions is the following linear 
homoscedastic model: 



(2.1) 



E{Yi) = 6,flA + (1 - Si)fiB + f (Zi)*(5i/3A + (1 - S^)PB), 
,2 



V{Y,) = a', i = l,...,n, 

where Yi is the outcome of the ith subject, ij-a and hb are the baseline 
treatment effects, f(-) is a known vector function, Zj is the vector of covari- 
ates observed on the ith individual and PatPb are p-dim vectors of possibly 
different regression parameters related to A and B, respectively. Under this 
model HA,fJ'B,PA and Pb are of interest since the relative performance of 
the treatments depends on the patient's covariates. Indeed, for any given 
covariate profile Zj, we obtain 



(2.2) 



(zi) = E{Yi I Si = l,Zi = z,) - E{Y, I 5, = 0, Z, 
= fiA-I^B + f (zO*(/3a -l3B) = a + f (zi) 



Zj) 



)r. 

After n assignments, let F = (f(z)*)„xp, = (h, ■ ■ ■,SnY, A„ = diag(5.„) 
and X = [dn :ln — Sn- A„F : (I„ — A„)F] , where 1„ and I„ are the n-dim vec- 
tor of ones and the n-dim identity matrix, respectively. Moreover, let 7„ = 
{fiA, fJ-B, be the least square estimator of 7 = (/_i^,;UB,/3^,/3^)*, 

then if (X*X)~^ exists, the variance-covariance matrix is 

mn) = ^'(X*X)-i = n-VM-i, 

where M is the (2 + 2p)-dim average {per observation) information matrix 





1 " 

i=l 





5*F 


\ 

Olxp 


(2.3) M = - 

n 





n 

i=l 
Opxl 

F*(l„-<5„) 


Olxp 


(1„ - SnYF 




F*d„ 

\ Opxl 


F*A„F 

Opxp 


Opxp 

F*(I„ - A„)F J 



Remark 2.1. In the absence of treatment / covariate interactions = 
(3b = P, that is, t = 0, and the homoscedastic model (2.1) becomes 

(2.4) EiYi) = 5i^lA + il-S^)^lB + ^iziy(3, i = l,...,n. 

In this case, it is customary to regard /3 as a nuisance parameter, since 
from (2.2) 6{zi) = ^a — IJ-B for any given covariate profile Zj, so that the 
inferential interest typically lies in estimating 11 a and 11 b-, or ha — IJ'B, as 
precisely as possible. 
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3. Inferential optimality and balanced designs. In order to avoid cum- 
bersome notation, from now on we assume, witliout loss of generality, only 
two categorical covariates, that is, Z = (T, W). Suppose that T is categorized 
into levels to,ti, . . . ,tj and let wo,wi, . . . ,wl be the levels of W, so that T 
and W can be represented by a J-dimensional vector T and a L-dimensional 
vector W of dummy variables, respectively, where to and wq are the refer- 
ence categories. Assume that {Zi,i > 1} is a sequence of i.i.d. random vec- 
tors, where each Zi is distributed in the population according to Y'i{Zi = 
{t,,wi)}=p{j,l)>Q (i = 0,...,J;Z = 0...,L), where E/=o Ef=o^'(i' = 1- 
Moreover, in order to account for the general situation of both treatment/ 
covariate interactions and interaction among covariates in the rest of the 
paper, we let 

(3.1) f*(z) = (T*,W*,T*®W*), 

that is, f(-) is the p-dim vector including all interaction effects, with p = 
J + L + J-L. 

At the end of a trial with n assignments, let Nn{j,l) = X]"=i l{Zi=(tj ,«;;)} 
be the number of subjects within the stratum {tj,wi), where repre- 
sents the indicator function, Nn{j,l) = X]"=i '5il{z,=(tj the number of 
allocations to A within this stratum and iTnijjl) the corresponding pro- 
portion, that is, TTnUJ) = Nn{j,l)~^Nnij,l), for any j = 0,...,J and 1 = 
0, . . . , L. Moreover, let N„ = {Nn{j, I) : j = 0, . . . , J;l = 0, . . . , L} and 7r„ = 

{7Tn{j,l).j = 0,...,J;l = 0,...,L}. 

Adopting model (2.1), the design for optimal inference consists in allo- 
cating the treatments so as to minimize one of the following criteria: 

CI det1/(7„) =det(n-V2M-i); 

C2 det1/(|J^) =det(n-V2D*M-iD), where D* = (02px2 : l2p); 

C3 tvVijJ =tr{n~^a^M~^); 

C4 trV{^^) =tr(n-iCT2D*M-iD); 

C5 tr y(/3l - 0b) = tr(n- V2E*M-1E), E* = (0^x2 : Ip : -Ip) 

with replaced by the Moore-Penrose inverse, if needed. It is easy to 

check that C1~C5 are convex functions of M, invariant with respect to 
permutations of the bottom two block rows and the two right-hand block 
columns of M. 

For given covariates, the jointly balanced design 

(3.2) 7r*j = {TT*j{j,l) = l/2, for any j = 0,..., J and / = 0,...,L}, 

is optimal for model (2.1) with respect to any criterion "3>/ of the informa- 
tion matrix M, which is convex and invariant w.r.t. permutations of the 
bottom two block rows and the two right-hand block columns, as well as the 
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first two rows and columns. To see this, it is straightforward to check that 
assuming (3.2) the ensuing information matrix 



(3.3) 



M* = — 
2n 



n 






n 

Opxl 

F*l„ 



1*F 

Olxp 

Ftp 
0. 



Oixp\ 
1* F 




Jpxp 



■'pxp 
F*F J 



F*l„ 

VOpxi 

is invariant w.r.t. permutations of the bottom two block rows and the two 
right-hand block columns, as well as the first two rows and columns. For 
any information matrix M of the type (2.3), by the simultaneous permu- 
tation of the first two rows and two columns as well as the bottom two 
block rows and the two right-hand block columns, we get the informa- 
tion matrix M corresponding to the design which switches treatments A 
and B. Clearly $/(M) = ^>/(M), (M + M)/2 = M* and then by convex- 
ity 

<l>i{M*) = «>/(|(M + M)) < i[$/(M) + $/(M)] = $/(M). 

Note that, independently on the presence or absence of treatment/covariate 
interactions, the jointly balanced allocation (3.2) is still optimal, even in 
the absence of interactions among covariates; see [5] for a detailed discus- 
sion. 



Proposition 3.1. Assuming model (2.1) with (3.1), inferential crite- 
ria C1-C5 can he simplified as follows: 



(3.4) CI; detf— M-i^=^ . 



A+ip 



(3.5) C2; det — D^M-^D 



n 

2 

C3.- trf — 

n 



ij,l)[l-7Tn{j,l)]Nn{j,iy' 



n ,=0 nf=0 ^n(i, I) [1 - vr„(j, 0]iVn(j, 1? 
I 



■ J L 
EE 



^^iVn,(j,07rn(j,0[l-7r„(j,0] 
L + 1 



(3.6) 



^iVn(j,0)7r„(j,0)[l-7r„(j,0)] 



j+i 



^iV„(0,Ovrn(0,0[l-vr„(0,0] 

(J+1) X (L + 1) 
A^„(0,0)7r„(0,0)[l-7r„(0,0)] 
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Furthermore, criterion C4 coincides with C5 and is given by 

V2 



(3.7) 



tr( — D^M-^D^ =trf— E^M-^E 
n J \ n 



n ) Af„(0,0)7r„(0,0)[l-7r„(0,0)]' 
Proof. See Appendix A.l. □ 

Remark 3.1. Contrary to CI and C2, from (3.6) and (3.7) it is easy to 
see that the trace criteria C3-C5 treat the covariate profiles in a different 
way due to the nature of the OLS estimators and the fact that these crite- 
ria correspond to the minimization of the mean variance of the estimators 
of the parameters of interest without taking into account their covariance 
structure. 

Note that C1-C5 depend on the design only through the allocation vec- 
tor tt; it is also straightforward to check that the above criteria are strictly 
convex in tt and will be minimized by (3.2) independently on the covariates. 
However, the loss of inferential precision expressed by C1-C5 is random 
since it depends on the number of subjects within the different strata N„, 
that is, ^>7 = $7(7r„,N„). Therefore, in order to remove the effect due to the 
random covariates, from now on we take into account the loss of inferential 
precision induced by the design 

(3.8) $7(7r„)=^z[^'/(7r„,N„)]. 

4. Optimal design for ethics. From an ethical viewpoint, a natural de- 
mand consists in an overall benefit for the entire sample of patients involved 
in the trial, for instance maximizing for any given sample size the percent- 
age of patients who receive the best treatment. This make sense if and only 
if the treatment effects are different; otherwise there is no longer a worse 
treatment, stressing that the comparative experiment degenerates to just 
one treatment. 

Assuming model (2.1), at each stratum {tj,wi) the superiority/inferiority 
of A or i? depends only on the sign of 9 in (2.2), and from now on we 
let for simplicity 0{j,l). Assuming "the-larger-the-better" scenario, for each 
subject with covariate profile {tj,wi) the allocation will be made to the supe- 
rior treatment if (^j]l{e(jy)>o} + (1 ~ ^i)^{e(j,i)<o}'i otherwise, if 9{j, I) = the 
two treatment arms collapse and all the allocations are equivalent (i.e., any 
ethical measurement is no longer useful). Thus, the percentage of patients 
assigned to the best treatment is 



j=o 1=0 



^ -7r„(i,/) 



sgn(0(i,/)) 
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where sgn(a;) represents the sign of x. However, from (2.2) the relative per- 
formance of the treatments depends on the subject's covariates, so that 
a reasonable ethical measure is 



$s(7r„,N„) 

(4.1) 

j=o 1=0 ^ 



2 -^n(j,0 



sgn(0(j,/)) 



under which every choice is weighed by the relative ethical gain \6(j,l)\. 
Obviously, criterion (4.1) depends on both the covariate profiles and the 
unknown parameters of the model and the optimal ethical target, namely 
the allocation that assigns all the patients to the better treatment, is 

(4.2) 7V% = {TT%{j,l) = l|9(j-i)>o} for any j = 0,. . ., J and l = 0,...,L}. 

Remark 4.1. In the absence of treatment /covariate interactions, 9{j, I) = 
a for any j = 0, . . . , J and / = 0, . . . , L, so that criterion (4.1) simply becomes 



that depends on the design only through the total proportion of assignments 
to A. Thus, under (2.4) the percentage of allocations to the best treatment 
does not depend on the covariates, which are irrelevant from an ethical 
viewpoint, so that the optimal ethical target is vr^(i, I) = 1{q>o} for Ui 0- 

Analogously to (3.8), in order to remove the random effect due to covari- 
ates from now on we adopt as ethical criterion $£;(7r„,) = £'z[^_B(7rn,, N„)] 
given by 

J L 



(4.3) ^EM=i2i2p(^^^^\^(^Ml 



j=0 1=0 



^-7r„(i,0 



sgn(0(j,/)) 



5. The compromise criterion. In order to obtain a suitable compromise 
between inferential precision and ethical demands, there are several pos- 
sible approaches. Among them, a trade-off the criteria via a combined or 
a constrained optimization has, to the best of our knowledge, the strongest 
theoretical justification; see, for example, [4, 9-11, 17, 22]. For the sake 
of generality, we now suggest a methodology based on the optimization of 
a compound criterion that mediates between information and ethics, since 
the constrained optimization approach can be regarded as a special case of 
this proposal (as it will be shown in Remark 6.1). 

Note that, for any chosen inferential criterion C1~C5, and are not 
homogeneous measures, and in order to put them in a comparable scale, we 
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consider their standardized version, that is, 

(5.1) ^.K) = |4^ and ^,M = bpl. 

where ttJ in (3.2) is the optimal inferential target minimizing and tt^ 
in (4.2) maximizes (4.3) by assigning all subjects to the best treatment, with 

^E{7V*E)=Ezmz)\]=EUl^tomiMjJ)- 

Clearly, ^e^^i ■[0,1]^'^^^'^'^^^^'^ — )• [0,1] represent standardized measures 
of ethical and inferential efficiency, respectively, that will be maximized. 
Therefore, by introducing an ethical weight uj, we let, as a compromise cri- 
terion, 

which can be seen as the reciprocal of the weighted harmonic mean of 
and see also [4, 11]. 

The ethical weight u in the compound criterion can be chosen by the 
experimenter, with < c^j < 1 (we assume a; 7^ 1 in order to avoid that the 
ethical impact completely overcomes the inferential goal). It may be fixed 
a priori or could be modeled as a function of the unknown parameters on 
the basis of the given real situation. In the latter case uj is allowed to depend 
on the true state of nature, since it is reasonable to suppose that the more 
the effects of the treatments differ, the more important for the patients are 
the chances of receiving the best treatment, whereas in the case of a small 
difference, which is more difficult to detect correctly, more emphasis is given 
on inferential precision. In particular, under (2.1) the ethical impact depends 
on the covariates, and thus, in order to express an overall measure of ethical 
risk for the population of interest, from now on we let a;(-Ez[|^(z)|]) U 
{0} — )• [0; 1) to be a a continuous and increasing function with a;(0) — )■ 0. 

In general, the choice of the weight function depends on the given applied 
context. For instance, in Phase III trials the experimenters have often some 
information gathered from previous stage trials, and more attention is usu- 
ally needed for inference, provided that the ethical costs are not prohibitive 
(such as deaths of patients). Thus, to can be chosen to be an S-shaped func- 
tion as 

(5.3) a;,(x) = (l + x-2)-2(^+i)[2-(l + x-2)-2] with s > 0. 

Additionally, since in several clinical situations it is reasonable to assume 
that the ethical concern is negligible in the case of small difference between 
the treatment performances, for example, up to a value ^ of the overall risk, 
and then increases rapidly, we may assume a;(x) = for x <<;, with u}{x) — ?• 
for X ^ <;~^ , and uj{x) — )• 1 for x — )■ 00. Whereas, an alternative choice for the 
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ethical weight is the cdf of a chi-square r.v. xfr) ' where u decreases as the 
degrees of freedom r increases. By fixing small degrees of freedom, the latter 
choice allows us to model the ethical impact in order to grow rapidly, even 
when the overall ethical risk is moderate. 

Theorem 5.1. For every inferential criterion C1-C5, the compound 
criterion ^(^(7r„) is a strictly convex function o/tt^, so there exists a unique 
target allocation minimizing (5.2) which is the solution of the system of 
equations 



(5.4) 



2 9$/(7r, 



CO 



l-LO 



Proof. The suggested compound criterion is a linear combination of the 
reciprocals of and "^e- Clearly, [^z]"^ is strictly convex; moreover ^e 
in (4.3) is linear, and thus concave, and it is also non-negative, so that ^e 
is strictly convex in 7r„. Therefore, criterion (5.2) leads to a unique target 
allocation satisfying V^'tj(7r„) =0, namely 

that leads to (5.4) after simple algebra. □ 

6. The optimal compound target. In this section we describe the proper- 
ties of the allocation tt* = {tt* (j, /) : j = 0, . . . , J; / = 0, . . . , L} that minimizes 
the compromise criterion ^'^ in (5.2), and we shall refer to it as "optimal 
compound target." In general, tt* depends on the experimental choice of 
the inferential criterion $/ and the ethical weight w, as well as on the true 
state of the nature, that is, the unknown parameters and the probability 
distribution p = {p{j, I) : j = 0, . . . , J;l = 0, . . . , L} of the covariates. 

Theorem 6.1. For every chosen inferential criterion C1-C5, at each 
stratum {tj,wi) the optimal compound target vr*(j,Z) satisfies the following 
properties: 

• '^t)U^^) ^ (0)1) 0, continuous function of a, t and p, that is, vr*(j, /) = 
7r*(j, /;a,T,p), and it is increasing in 9{j,l); 

• if (a',r') and {a",T") are parameter values with corresponding ethical 
gains 0'{j,l) = —0"{j,l), then for any given covariate distribution p, 

(6.1) 7r*(i,/;a',r',p) = 1 - vr* (j, a", r", p), 

so that the optimal compound target always assigns more than half the 
subjects to the better treatment; 
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• if 9{j,l) > 0, then vr^(j, ^•^ increasing inp{j,l), whereas when 9{j,l) < 0, 
then vr*(j, Z) is decreasing inp{j,l). 

Proof. For any given stratum {tj,wi), let A:(7r„(j,/)) = 3^>j-(7r„)/ 
d'^nUJ)- From the convexity of the inferential criterion, the function fc(-) 
is monotonically increasing with k{l/2) =0, due to the optimality of the 
jointly balanced design in (3.2). Furthermore, the right-hand side of (5.4) 
is a continuous function of a, r and p, due to the properties of it 
is straightforward to check that, for any given p{j,l), it is also an increas- 
ing function of 0{j,l) and is monotone in p{j,l) (increasing when 9{j,l) > 
and decreasing if 0{j,l) < 0). Since the sign of the left-hand side of (5.4) 
depends only on the sign of k{-), if 6{j,l) > (i.e., A is better than B 
for this stratum), then the right-hand side of (5.4) is positive and thus 
the optimal compound target vr*(j, /) > 1/2; otherwise, if 0{j,l) < 0, then 
7r*(j, Z) < 1/2 and 9{j,l) = if and only if '/r*(j, I) = 1/2. Moreover, observe 
that lim^_^o^(C) = ~oo and lim^_^i A;((^) = -|-oo and thus 7r*(j, /) / {0,1}, 
since $_e(-) is limited. By taking the derivative of the left-hand side of (5.4) 
with respect to 7r„(j, /) we obtain 

^^^^i^i^EM? + 2^E{nn)HMj, l)p{j, I), 

where the first term is always positive, due to the convexity of the inferential 
criterion. Furthermore, locally around '/r*(j, /) 

M<(i,0)^(j,Op(j,0>o, 

since if 0{j,l) > 0, then 7r*(j, /) > 1/2 and k{7r^{j,l)) > (and, analogously, 
when 6{j,l) < 0). Thus, as a function of TTn{j,l) the left-hand side of (5.4) 
is locally increasing around 7r*(j, /), so that vr*(j, Z) is a continuous function 
of a, T and p and it is increasing in 0(j, /) and p(j, /), due to the property of 
the right-hand side of (5.4). Concerning (6.1), for a given covariate distribu- 
tion p the ethical criterion in (4.3) can be regarded as a function of vr„(j, /) 
and 9{j,l) by letting 



i=0 1=0 ^ 



2 -^n(i'0 



sgn(0(i,/)) =t; + 5(7r,(j,0,^(j,0), 



where 

5(vr„(j, 0,^(i,0) = l^(j,Olp(j,0 



^ -7r„(j,0 



sgn(0(i,/)) 



so the left-hand side of (5.4) can be rewritten as ["L" -|- (?(7r„(j, /), ^(j, /))] • 
k{'Kn{j-,l))- First of all note that, for every chosen inferential criterion Cl- 
C5 the function k{-) is symmetric around the point (1/2; 0) since ^(1/2 -|- 
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e) = -k{l/2-e) for any eG (0;l/2). Moreover, g{l/2 + e,e{j,l)) = g{l/2 - 
e,—9{j,l)), so that the left-hand side of (5.4) is also symmetric around 
(1/2; 0), and this implies the symmetric property of the compound target. 

□ 

6.1. Example: the inferential criteria based on the determinant. From (3.4) 
and (3.5) it is easy to see that CI and C2 have the same standardized version, 

J L 

(6.2) */(7r„) =4(^+i)(^+^) nn^"(j''^)[l -^"(•?■'^)^ 

i=o^=o 

Assuming now two binary covariates and two different scenarios for their 
population distribution, that is, a uniform one U where each stratum is 
equally represented, that is, p(0,0) =p(l,0) =p(0,l) =p(l,l) = 0.25, and 
a nonuniform distribution J\fU with p(0, 0) = 0.2, p{l, 0) = 0.3, p{0, 1) = 0.4, 
p(l, 1) = 0.1. Table 1 shows the derived optimal compound targets in the 
case of four different ethical weights, namely the cdf 's of a x^r) ^^^h r = 1, 2 
and LOg in (5.3) with s = 1, 2. 

The optimal compound target always assigns the majority of subjects 
to the better treatment. The ethical weight increases as r and s decrease, 
and therefore less emphasis is given to the inferential precision and more 
attention to ethical demands, as expected. Furthermore, since criteria CI 
and C2 treat every stratum in the same way, when p{j,l) =pij,l) and 
9{jJ) = 1 - 6(jJ), then 7r*Jj,l) = 1 - KQJ)- 

6.2. Example: the inferential criteria based on the trace. Consider two 
binary covariates under the same settings of the previous example. By taking 
the approximation Ez[Nn{jJ)~^] = [np{jj)]~^, Table 2 shows the optimal 
compound targets when the inferential criterion C3 is adopted, whereas 
Table 3 deals with criterion C4 (or, equivalently, C5). 

Note that, since C3-C5 treat the strata in a different way, even if p{0, 1) = 
p{l, 1) = 0.25 and 61(0, 1) = -0{l, 1) = - 1, that is, when (a, r*) = (-4, -1,3,3), 
then 7r:(0,l)/l-7r* (1,1). 

In general, when r = s the ethical skew is larger if we adopt the cdf 
of xfr) w.r.t. ujg, and this skew is particularly high for r = 1, which could 
induce strong imbalances among the treatment groups (see Tables 1-3 with 
(a,T*) = (—4,-1,3,3)) and, consequently, a great loss of inferential effi- 
ciency. This behavior suggests that the adoption of the cdf of xfi^ is adequate 
only in situations with prohibitive ethical costs. 

Remark 6.1. Using the analytical expressions of the inferential crite- 
ria given in Proposition 3.1, by (5.1) it is also possible to derive optimal 
targets via a constrained optimization approach. In the same spirit of [22], 



Table 1 

Optimal compound targets adopting criteria C1-C2 









(a,T') = 


(1,1,1,1) 






(«,t') = 


(- 


■4,-1,3,3) 








0(0,0) = 1 


0(1,0) =2 


0(0,1) = 2 


0(1,1) =4 


0(0,0) = -4 


0(1,0) = - 


■5 


0(0,1) = -1 


0(1,1) = 1 






<(o,o) 


<(i,o) 


<(o,i) 


<(1,1) 


<(o,o) 


<(i,o) 




<(o,i) 


<(1,1) 


X(i) 


NU 


0.578 


0.700 


0.743 


0.646 


0.278 


0.186 




0.371 


0.534 


u 


0.593 


0.670 


0.670 


0.771 


0.242 


0.209 




0.415 


0.585 


X(2) 


NU 


0.544 


0.623 


0.660 


0.587 


0.352 


0.264 




0.421 


0.520 


U 


0.554 


0.605 


0.605 


0.689 


0.319 


0.287 




0.449 


0.551 




NU 


0.537 


0.606 


0.637 


0.572 


0.353 


0.265 




0.421 


0.520 




U 


0.549 


0.596 


0.596 


0.674 


0.321 


0.289 




0.449 


0.551 




NU 


0.521 


0.562 


0.581 


0.541 


0.397 


0.324 




0.447 


0.513 




U 


0.530 


0.559 


0.559 


0.614 


0.373 


0.346 




0.466 


0.534 












Table 2 


















Optimal compound targets 


adopting criterion C3 














(a,T^) = 


(1,1,1,1) 






(«,T^) = 


(- 


■4,-1,3,3) 








0(0,0) = 1 


0(1,0) =2 


0(0,1) =2 


0(1,1) =4 


0(0,0) = -4 


0(1,0) = - 


■5 


0(0,1) = -1 


0(1,1) = 1 






<(o,o) 


7r'(l,0) 


7r'(0,l) 


<(1,1) 


7r'(0,0) 


7r'(l,0) 




<(0,1) 


7r:(i,i) 


X(i) 


NU 


0.658 


0.868 


0.900 


0.805 


0.179 


0.077 




0.128 


0.677 


U 


0.697 


0.835 


0.835 


0.916 


0.154 


0.099 




0.214 


0.846 


X(2) 


NU 


0.572 


0.792 


0.841 


0.706 


0.277 


0.125 




0.205 


0.582 


U 


0.598 


0.745 


0.745 


0.866 


0.241 


0.158 




0.318 


0.759 


Wi 


NU 


0.557 


0.767 


0.821 


0.678 


0.279 


0.126 




0.206 


0.581 




U 


0.586 


0.728 


0.728 


0.856 


0.243 


0.159 




0.320 


0.757 




NU 


0.530 


0.696 


0.760 


0.610 


0.346 


0.169 




0.268 


0.546 




U 


0.548 


0.658 


0.658 


0.806 


0.308 


0.210 




0.382 


0.692 



H 

?3 
H 

O 

o 

H 

a 
a 
o 
d 

> 

< 

S 

> 

H 

a 

o 
o 

a 

H 
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Table 3 

Optimal compound targets adopting criteria C4 or C5 
(a,T') = (l, 1,1,1) (a,T') = (-4,-1,3,3) 

0(o,o) = i 0{i,o) = 2 e{o,i) = 2 e{i,i)=4 e{o,o) = -4 0(i,o) = -5 0(o,i) = -i 0(i,i) = i 







7r:(0,0) 


7r*(l,0) 


7r'(0,l) 


7r:(l,l) 


7r'(0,0) 


7r'(l,0) 


7r'(0,l) 


7r:(i,i) 




NU 


0.677 


0.860 


0.895 


0.795 


0.166 


0.082 


0.137 


0.663 


U 


0.717 


0.827 


0.827 


0.912 


0.142 


0.105 


0.225 


0.837 


X(2) 


NU 


0.585 


0.782 


0.833 


0.694 


0.259 


0.133 


0.217 


0.573 


U 


0.615 


0.734 


0.734 


0.859 


0.223 


0.167 


0.331 


0.747 


LJi 


NU 


0.567 


0.756 


0.812 


0.666 


0.261 


0.134 


0.218 


0.572 




U 


0.601 


0.717 


0.717 


0.849 


0.225 


0.169 


0.333 


0.744 




NU 


0.536 


0.685 


0.749 


0.601 


0.328 


0.179 


0.282 


0.541 




U 


0.558 


0.645 


0.645 


0.797 


0.289 


0.221 


0.393 


0.679 
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Table 4 

Constrained optimal targets adopting criteria C1-C2 and Bandyopadhyay and Biswas's 
allocations tit, with (q,t*) = (1, 1, 1, 1) and uniform distribution 







6»(0,0) = 1 


0(1,0) = 2 


0(0,1) = 2 


0(1,1) =4 




*j = C 


















IjJc 


= 0.356 


0.523 


0.546 


0.546 


0.589 


0.56 


0.95 


bJc 


= 0.483 


0.528 


0.566 


0.566 


0.612 


0.59 


0.9 


LOC 


= 0.700 


0.558 


0.612 


0.612 


0.698 


0.64 


0.75 


bJc 


= 0.883 


0.599 


0.679 


0.679 


0.781 


0.72 


0.5 


UJC 


= 0.969 


0.656 


0.756 


0.756 


0.851 


0.79 


0.25 






TTT (0,0) 


7rT(l,0) 


7rT(0,l) 


^t(1,1) 




10"® 


r = 


-- 1 


0.841 


0.977 


0.977 


0.999 


0.97 


r = 


--2 


0.691 


0.841 


0.841 


0.977 


0.88 


0.02 


r = 


-i 


0.631 


0.748 


0.748 


0.909 


0.81 


0.17 



the problem lies in finding the allocation that maximizes the ethical impact 
for a chosen inferential efficiency. In our context this corresponds to min- 
imize under the constraint < for a prefixed constant C < 1 
(we exclude the degenerate case C = 1 that corresponds to assume = 1 , 
i.e., no ethical concerns), representing a special case of our combined opti- 
mization approach; see also [4, 9, 10]. Indeed, due to the strict convexity 
of both and "^J , this is a convex optimization problem, and therefore 
the Karush~Kuhn~Tucker (KKT) first order conditions are necessary and 
sufficient and guarantee a unique optimal solution vr* solving 

(6.3) V{^^H7r*)) + KV{^j\7r*)) = 0, 
where k > is the KKT multiplier and 

(6.4) k{^j\7v*)-C-^)=0. 

First, note that (6.3) corresponds to (5.4) with k= (1 — uj)/uj (where clearly 
oj 7^ 0, since C ^ 1); thus, the candidate solution vr* belongs to the class of 
compound optimal targets and should satisfy ^J^{7v*) = C~^, since k > 0. 
By using the same arguments of the proof of Theorem 6.1, any compound 
target vr* solving (5.4) is a continuous and monotone function of u and, from 
the properties of ^7^' ^^^d C < 1 there exists a unique constant 

weight uj = ioc such that tt* = tt*^ satisfies (6.4). 

As a numerical example, consider now the standardized inferential cri- 
terion (6.2) in the case of two binary covariates with (a,T*) = (1,1,1,1) 
and uniform distribution Li. The upper block of Table 4 shows the derived 
constrained optimal targets as C varies, together with the corresponding 
ethical weight ujc of our combined optimization approach and the value of 
the ethical criterion ^e- Moreover, the bottom part of the Table gives the 
asymptotic allocations of the procedure suggested by Bandyopadhyay and 
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Biswas [7] (and further analyzed in [3]), which are given at each stratum 
(j, /) by the standard normal cdf evaluated at 6(j,l)/T, for T= 1,2 and 3. 
For instance, an inferential efficiency equal to 75% under the constrained 
approach corresponds to an ethical weight equal to 70% in the combined 
framework; whereas, assuming C = 0.25 means that the role of ethics is al- 
most dominant in the combined optimization. Clearly, the arbitrary choice 
of the constant C in the constrained setting can be directly translated in the 
subjective choice of to in the combined approach; however, the possibility of 
modeling the ethical weight as a function of the unknown parameters allows 
us to discriminate among different situations that could be a-priori only 
partially known or, more commonly, completely unknown. For instance, if 
we set uc = 0.483, then the 38.8% of subjects within the stratum (1, 1) will 
receive the worst treatment; whereas, the percentage of allocations to the 
worst treatment is only 22.9% or 32,6% if the cdf of X(i) or wi are assumed 
as weight functions, respectively; see Table 1. Moreover, the allocations pro- 
posed by Bandyopadhyay and Biswas (2001) are strongly skewed toward the 
better treatment, so that the inferential precision collapses. This is particu- 
larly true for small values of T, as emphasized in [18], and this behavior is 
also confirmed, even if we adopt criteria C3-C5. 

7. The reinforced doubly adaptive biased coin design. As shown previ- 
ously, the compound target allocation tt* depends in general on the unknown 
parameters of the model and, since this function is continuous, covariate- 
adjusted response-adaptive procedures may be called for. These designs use 
at each step the observed responses, the covariates and the previous assign- 
ments, as well as the covariate profile of the current subject, to modify the 
allocations as the experiment goes along in order to gradually approach the 
desired target. 

In this section we introduce the reinforced doubly adaptive biased coin 
design. This is a general class of CARA procedures, which admits both 
continuous and discontinuous randomization functions, aimed at targeting 
any chosen allocation proportion by forcing closeness to the target when 
necessary. 

Let now tt* = {vr* : j = 0, . . . , J;l = 0, . . . , L} be a desired allocation 
such that, at each stratum {tj,wi), ■K*{j,l) E (0,1) is a continuous function 
of the unknown model parameters -f and p. Suppose that patients come to 
the trial sequentially and are assigned to either treatment. Starting with m 
observations on each treatment, usually assigned by using restricted random- 
ization, an initial non-trivial parameter estimation 'J2m P2m is derived. 
Then, at each step n {n> 2m) let -y^ and p„ be the estimators of the pa- 
rameters based on the first n observations, where we assume them consistent 
in case of i.i.d. observations, so that the optimal target will be estimated by 
all the data up to that step by tt* = 7r*(7„, p„). When the (n -|- l)st patient 
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with covariate profile = {tj,wi) is ready to be randomized, the rein- 

forced doubly adaptive biased coin design assigns to him/her treatment A 
with probability 



where the function (p{x, y, z) : (0, 1)'^ — t- [0, 1] satisfies the following properties: 

(i) ip is decreasing in x and increasing in y, for any z G (0, 1); 

(ii) X, z) = X for any z G (0, 1); 

(iii) If is decreasing in z if x < y, and increasing in z if x > y; 

(iv) ip{x, y,z) = 1 — ip{l — x,l — y,z) for any z G (0, 1). 

Adopting the RDBCD in (7.1), within each stratum the allocation propor- 
tion will be forced to the corresponding target, since from conditions (i)-(ii) 
when x>y, then ip{x,y,z) < y and if x < y, then ip{x,y,z) > y for any 
z G (0,1). Furthermore, condition (iv) simply guarantees that A and B are 
treated symmetrically, whereas (iii) means that the allocation is forced to- 
wards optimality increasingly as the representativeness of the strata in the 
population decreases. This property is of great importance since the con- 
vergence of the allocation proportion depends on the number of subjects 
belonging to each stratum, and therefore it is related to the population dis- 
tribution of the covariates. This may be particularly critical for small sam- 
ples, where some profiles could be strongly under-represented so that, both 
from the ethical and inferential viewpoint, the need to force the closeness to 
the target could be greater. 

Remark 7.1. In general, at each step the allocation probability (7.1) 
does not depend only on the estimates of the unknown parameters related 
to the stratum where the current subject belongs; in fact, 7f*(j, /) could in- 
volve the estimation of the entire set of parameters 7 and p. For example, 
if we adopt the optimal compound target tt*, at each stratum 7r*(j, /) de- 
pends on a, T and p [see, for instance, (5.4)], so that (7.1) depends on the 
information gathered up to that step from all the strata. 

Observe that we do not assume the continuity of 99, and therefore it is 
possible to consider discontinuous randomization functions. For instance, 
a natural extension of the ERADE proposed by [14] in the presence of co- 
variates is 



(7.1) 




where the constant p G [0, 1) controls the degree of randomness. 



Remark 7.2. The randomization function in (7.1) could also be chosen 
in a different way for each stratum {tj,wi) by letting ip = ipji, in order to 
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discriminate the importance of each of them and the corresponding closeness 
to the target. Furthermore, the RDBCD can be naturally extended to the 
case of several treatments. 

An interesting family of allocation functions belonging to the RDBCD is 

if{x-y-z) = F[D{x-y)''^-^^F-\y)] 

(7.3) /{F[D{x-yf^^)F-\y)] 

+ F[D(l-x;l-y)^(^)F-i(l-y)]), 

where FiM"*" — )• M"*" is continuous and strictly increasing, H[z) is decreas- 
ing, while D(x;y) : (0; 1)^ — t- M+ represents a dissimilarity measure between 
the actual allocation proportion x and the current estimate of the optimal 
target y, and D is assumed to be decreasing in x and increasing in y, with 
D{x;x) = 1. 

Example 7.1. Letting D{x;y) = 1 for any {x,y) G (0,1)^, then (7.3) 
corresponds to the CARA design analyzed by Zhang et al. [23], namely 

(7.4) ipz{x;y;z)=y V(x, z) G (0, 1)^, 

which represents an analog of the sequential maximum likelihood design in 
the presence of covariates. Whereas if we let F(t) = t, D{x;y) = y /x and 
H(z) = > G (0, 1), we obtain a natural extension in the presence of 
covariates of the family of doubly adaptive biased coin designs, that is, 

yjy/x)" fi f n n 

y{y/xy + {l-y)[{l-y)/{l-x)Y ^ ' 

However, note that the previous allocation function does not correspond 
to the covariate- adjusted doubly adaptive biased coin design suggested by 
Zhang and Hu in [24], due to the fact that these authors assume at each 
step n a dissimilarity measure between the actual allocation proportion 
T^nUJ) and the mean (over the steps) of the estimates of the optimal target 
X]r=i ^2*0'' instead of the current estimate of the target vf* (j, /) itself. 

Example 7.2. If we set F{t) = with /o 0, D{x] y) = l — [x — y) and 
H{z) = z~^ Mz G (0, 1), then (7.3) becomes 

(^.\ ( \ y[l-{x-y)fl^ 

(7.5) <fBAzA^'^y'^z)- 



y[l - (x - y)fl' + (1 - y)[l - {y - x)]>'/' ' 

In order to account for discontinuous allocation functions, let for instance 
F{t) = t, H{z) = {{J + 1)(L + l)z}-i and 

j-l + e, x<y, 
D{x;y) = S 1> x = y, 

[l-£, x>y, 
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with £ G [0, 1). Then (7.3) becomes 

^BAZ2{x\y;z) 

(7.6) 



y[l + £){{J+1)(L + 1)4-' + (1 _ _ £){(J+1)(L + 1)4-1 ' 

X < y, 

y(l _e){{^+i)(i+i)4~' 

y(l _ £){{J+1)(L + 1)2}-1 + (1 _ + £){(J+1)(L + 1)4-1 ' 

X > y, 

which allows us to force the allocations toward the chosen target increasingly 
the more we move away from the uniform distribution, maintaining at the 
same time a good degree of randomness. 

Remark 7.3. If we assume only an inferential viewpoint by letting 
w = 0, then the optimal target is the jointly balanced allocation in (3.2), 
so that the allocation probability (7.1) corresponds to a stratified random- 
ization. For instance, letting F{t) = t^, H{z) = 1 and D{x;y = 1/2) = 1 — 
2{x — 1/2), then procedure (7.3) corresponds to the -Dyi-optimal design pro- 
posed by Atkinson [1], 

[l-vr„(j,0]'+vrn(j,/)2' 
see the supplementary data in [5] for details. 

Let TT*^.^^ = (7r*(j,l)p(j,l),...,7r*(j,L)p(j,L))* and ^f** = (7r**,7r^, 

^T= (Y,7r*{lJ)p{l,l),...,Y,^*iJ^l)PiJ^l)] ' 
V/=0 i!=0 / 

Vjr=0 3=0 ) 

and TT^^yy given by 

(7r*(l,l)p(l,l),...,7r*(l,L)p(l,L),..., 

7r*(J,l)p(J,l),...,7r*(J,L)p(J,L))*. 

The following theorem establishes the strong consistency of both the al- 
location proportion and the estimator of the target, as well as the strong 
consistency and asymptotic normality of the estimators of the unknown pa- 
rameters of the model. 
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Theorem 7.1. For any given target allocation tt* G (0, which 
is a continuous function of the unknown model parameters 7 and p, then 
adopting the reinforced doubly adaptive biased coin design (7.1), as n tends 
to infinity 

(7.7) 7Vn — )■ TT* a.s. and tt* — >• tt* a.s. 

Furthermore, 

(7.8) 7.„— >7 a.s. and ■v/n(7^ — 7) — )• A^(0; A4^"'^(7r*)) inlaw, 
where Ai(7T*) is 



\ 

with 

(7.9) 



J L 













^;** 

TT 


Olxp 


j=0 1=0 


J L 






1 


i=o z=o 


Olxp 


Ip-TT 


— * 

TV 

Opxl 


Opxl 
-1 ~* 
Ip - TT 


Opxp 


Opxp 









Mi, 



such that AI22 = diag(7r^^,yy 
/ 
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_*< 




diag(7r|.) 
















* 


diag(7r^) 
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TT 








*t 



\diag(7r*^^^) diag(7r*^^^) ... diag(7r*^^^) / 

Moreover, M.^ is partitioned similarly to (7.9) with treatment A replaced 
by B, that is, M.22 = diag(l jl — tt^ 



( 



diag(l,7 - tt:; 



\ 



diag(li -TT^)/ 
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\diag(li - 7r*^^w) diag(lL - 7r*^^w) 
Proof. See Appendix A. 2. □ 
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diag(l2 



Note that the asymptotic normality of the allocation proportions can be 
derived as in [13, 14, 23] by adding suitable conditions of differentiability for 
the target tt* and, eventually, for ip. 

Corollary 7.1. Let the optimal compound target tt* be the desired 
allocation, then adopting the RDBCD in (7.1), as n tends to infinity 

7r„ — 7- TT* a.s. and tt * — t- tt* a.s. 

7„ — )■ 7 a.s. and -v/n(7„ — 7) — )■ A^(0; A4~"'^(7r* )) inlaw. 

8. Finite sample properties. In order to perform some finite sample com- 
parisons, we have conducted a simulation study by adopting the inferential 
criterion CI and assuming as ethical weight function uj the cdf of xfiy More- 
over, we have taken into account normal responses with = 1 and two bi- 
nary covariates with the previously used settings, that is, (i) two population 
scenarios, namely the uniform distribution U and the nonuniform one MV( 
with p{0, 0) = 0.2, p{l, 0) = 0.3, p{0, 1) = 0.4, p{l, 1) = 0.1, (ii) two parame- 
ter settings: (a, r*) = (1, 1, 1, 1) and (a, r*) = (—4, —1,3, 3). The results come 
from h = 500 simulations with m = 4 and n = 500. Concerning the continu- 
ous randomization functions, we consider the CARA design (pz in (7.4) and 
the RDBCD in (7.5) with k = l. As regards the discontinuous case, we per- 
form the simulations adopting ipBAZ2 in (7-6) and v^erade in (7.2), where 
we put £ = p = 2/3, for homogeneity. Expectation and standard deviation 
(within brackets) of the proportion of allocations to treatment A are given 
in Tables 5-8. 

In general, ipz in (7.4) is characterized by the strongest variability with 
respect to the other procedures, since it is based only on the current estimate 
of the target, independently on the actual allocation proportion. Moreover, 
as theoretically shown in [23], the variability of this design increases as the 
representativeness of the strata decreases, and this behavior is also confirmed 
by 97erade; see Tables 7 and 8. On the other hand, fBAZi and (PBAZ2 
tend to balance the variability of the allocation proportions between the 
different population strata. Indeed, the standard deviations of the design 
in the different patterns are similar, since the reinforced doubly adaptive 
biased coin design forces the closeness to the desired target the more the 
are under-represented. 
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Table 5 

Expectation and standard deviation (within brackets) of the proportion of allocations to A 
under the uniform distribution lA with (a,T*) = (1, 1, 1, 1), so that the optimal compound 



target 


is TT* (0,0) =0.593, TT* 


(1,0) = ^(0,1) 


= 0.670 and 7r*(l,l) 


= 0.771 




7r„(0,0) 


7r„(l,0) 


7r„(0,l) 


7r„(l,l) 




0.592 


0.667 


0.666 


0.764 




(0.051) 


(0.049) 


(0.045) 


(0.041) 


tpBAZi 


0.592 


0.667 


0.670 


0.768 




(0.027) 


(0.027) 


(0.026) 


(0.025) 


fBAZ2 


0.591 


0.668 


0.669 


0.769 




(0.017) 


(0.016) 


(0.016) 


(0.014) 


</3eRADE 


0.589 


0.665 


0.666 


0.764 




(0.019) 


(0.019) 


(0.019) 


(0.018) 



Note that, in general, discontinuous randomization functions perform bet- 
ter with respect to the continuous ones. This is quite natural since, when 
the allocation proportion is around the target — in particular, for sufficiently 
large samples, due to the almost sure convergence — the continuous allocation 
procedures randomize the assignment with probability close to the target, 
while the discontinuous ones tend to force the allocation in the same way at 
each step, even asymptotically. 

Moreover, our simulation study points to the fact that (pBAZ2 guaran- 
tees always a stable behavior and tends to have better performances w.r.t. 
VeradE; especially for strongly under-represented strata. For instance, in 
the case of a nonuniform covariate distribution, when {tj,wi) = (1,1) the 
standard deviation of the allocation proportions under '^baz2 is almost half 
of the (/Jerade's one (see Tables 7 and 8). 

Table 6 

Expectation and standard deviation (within brackets) of the proportion of allocations to A 
under the uniform distribution U with (a,T*) = (—4,-1,3,3), so that the optimal 





compound target is 7r*(0,0) 


= 0.242, 7r:(l,0) = 


0.209, 7r:(0,l) 


= 0.415 and 






7r:(l,l) =0.585 








7r„(0,0) 


7r„(l,0) 


7r„(0,l) 


7r„(l,l) 


^z 


0.250 


0.217 


0.416 


0.582 




(0.042) 


(0.041) 


(0.049) 


(0.050) 


fBAZi 


0.244 


0.211 


0.412 


0.585 




(0.024) 


(0.022) 


(0.024) 


(0.026) 


fBAZi 


0.244 


0.212 


0.415 


0.585 




(0.013) 


(0.013) 


(0.017) 


(0.016) 


V^ERADE 0.251 


0.217 


0.417 


0.584 




(0.017) 


(0.016) 


(0.018) 


(0.019) 
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Table 7 

Expectation and standard deviation (withm brackets) of the proportion of allocations to A 
under the nonuniform distribution MU with (a,T*) — (1, 1, 1, 1), so that the optimal 



compou 


nd target is tt^^ 


;(0,0) =0.578, 7ri(l,0) = 


0.700, 7r:(0,l) 


= 0.743 and 






7r:(l,l)= 0.646 








7r„(0,0) 


7r„(l,0) 


7r„(0,l) 


7r„(l,l) 




0.576 


0.696 


0.732 


0.651 




(0.054) 


(0.041) 


(0.034) 


(0.071) 


<PBAZi 


0.577 


0.699 


0.739 


0.646 




(0.026) 


(0.025) 


(0.024) 


(0.028) 


fBAZ2 


0.577 


0.698 


0.740 


0.646 




(0.017) 


(0.015) 


(0.014) 


(0.017) 


V^ERADE 


0.576 


0.694 


0.738 


0.640 




(0.021) 


(0.018) 


(0.014) 


(0.030) 



The same conclusions have been observed through further simulations, 
omitted here for brevity, with larger sample size. However, in this paper 
we decided to present the case of n = 500 in order to emphasize the evo- 
lution of our procedure, with respect to the others, especially for strongly 
under-represented strata (note that the expected number of patients within 
stratum (1,1) under the nonuniform distribution MU is 50). 

9. Discussion. In the context of clinical trials for treatment comparisons, 
several different approaches have been proposed in the literature in order to 
provide a valid trade-off between ethical concerns and inferential precision 
(such as group sequential designs, interim analysis, etc.) and over the past 

Table 8 

Expectation and standard deviation (withm brackets) of the proportion of allocations to A 
under the nonuniform distribution ATU with (a,T') = (—4,-1,3,3), so that the optimal 



compou 


nd target is 7r*(0,0) 


= 0.279, ^:(1,0) = 


0.186, 7r:(0,l) 


= 0.371 and 






TT* (1,1) =0.534 








7r„(0,0) 


7r„(l,0) 


7r„(0,l) 


7r„(l,l) 


^z 


0.284 


0.197 


0.377 


0.539 




(0.050) 


(0.041) 


(0.035) 


(0.073) 


<PBAZi 


0.279 


0.188 


0.373 


0.535 




(0.026) 


(0.021) 


(0.026) 


(0.024) 


'PBAZ2 


0.280 


0.189 


0.373 


0.534 




(0.015) 


(0.015) 


(0.013) 


(0.013) 


V^ERADE 


0.286 


0.195 


0.375 


0.533 




(0.019) 


(0.018) 


(0.014) 


(0.023) 
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20 years there has been a growing stream of statistical papers on the topic 
of response-adaptive randomization; see, for instance, the seminal books of 
[12, 16]. Within this framework, in this paper we suggest a design strategy 
that combines efficiency and ethical gain for responses following a linear ho- 
moscedastic model. By using a compound optimization approach we derive 
optimal target allocations for the treatments that can be implemented via 
the adoption of a new class of CARA randomization procedure. Through 
the proposed methodology, the optimal compound allocations move away 
from balance (i.e., the optimal inferential target) toward the better treat- 
ment adaptively, on the basis of the treatment effects. Since joint balance 
implies marginal one, the proposed design strategy is robust with respect to 
possible misspecification of the model in terms of presence or absence of in- 
teractions among prognostic factors, or between treatments and covariates. 
Moreover, the proposed methodology is quite robust and performs well, even 
in the case of approximate homoscedasticity of the outcomes (perhaps after 
suitable transformations), as also pointed out by [3, 18]. On the other hand, 
in the case of heteroscedastic responses and, more generally, for generalized 
linear models, balance does not imply efficiency; our approach could still 
be applied, but with different inferential criteria (that could be optimized 
numerically) and different weight functions. Further research is needed on 
this topic. 



APPENDIX 



A.l. Proof of Proposition 3.1. 

CI. For the sake of simplicity in this Appendix, we will often omit the 
subscripts indicating the dimensions of vectors and matrices. Let M = nM, 
then 



det M = det 



F*d F*AF 



Olxp 



\Opxi 0^ 



pxp 





Opxl 



Olxp 
Opxp 



n ■ 



5^5. {1-dYF 



i=l 



F\l-S) F*(I-A)F/ 



Let Qa = F*AF and Ob = F*(I - A)F, and we obtain that 
det M = det I X] 5i - S^Ffl'^F^S I 



xdetfifii \ n-^6i \ - {1 - dYFn]^^F\l - S) I. 



1=1 
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Let now d*F = (N^, N^, N^^^), where 



{Nih),...,N{tj)), N* 



{N{wi),...,NiwL)) with iV(ti) = E«=o^(i'0 and iV^) = E,=o 0, 



and Ni, 



{N{1,1),...,N{1,L),...,N{J,1),...,N{J,L)). Clearly, iV(t,-) 



and N(wi) are the number of subjects assigned to treatment A within cat- 
egory tj of T (j = 0, . . . , J) and wi oi W {I = 0, . . . , L), respectively. Also, 

let N*^.^^ = (iV(j, 1), . . . , iV(j, L)) and N*,^^, = (iV(l, /),..., iV( J, /)). Then, 
the matrix fi^ can be partitioned as follows: 



(A.l) 
where 





B 




C 





/ 




i\r* 


A = diag(NT,Nw) + 




OjxJ 








f ■ ■ ■ Nj_j0w 


OlxL 



B 







N 








\diag(Nt,j 





diag(Nt2®w) 








"NT* 

diag(Nt^ 



and C = diag(NT®w)) where OkxK is the X-dim zero matrix. Thus, 



(A.2) 
where T 









u 





^) + (-C-1B*) r-^(ij+L,-BC-i), 



A - BC^B* 



diag(Nir®„,o,Nt(j0w)- Note that CIa is nonsin- 
gular if^and only if^ and T are^ nonsingular and detfi/i = detC • detF = 
Iljt=iNij,l)Uj=iNU,0)ntiNiO,l). From (A.2) it follows that 



S^Fft-^^F^S = d*F 



where 















F*5 + 5*F 



C iB* 



)f-1(Ij+l,-bc-1)f*5, 



<5*F 















^ .7 = 1 1=1 



and 



S^F 



Ij+L 

C ^B* 



F-i (Ij+z., -BC-i)F*<5 = 5] NU, 0) + ^ iV(0, /), 



1=1 
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Since 



(A.3) 



and 








li • 


• 0\ 








II 


• I'i 



(A.4) 



r~^BC 



/ N{l,0)-^li ^ 

iV(2,0)-il* 







\diag 



'""^ diag 








to 



7 



Therefore, S'Fil^'F'd = ^f=i ^i^, + EU ^(•?' 0) + ^f=i ^{0, 0, 

and consequently X^^^i 5^ - S^F{flA)~'^F^S = N{0, 0). Hence, 

J L 

det M = n n ^)'^(-?' ^) [1 - ^(j' ^)] 

j=0 1=0 

and thus (3.4) follows directly. 

C2. Note that M^"*^ can be partitioned as follows: 









^22 



where is given by 



/ n 



\ 



71 6. 

i=i ) 

_ /7V(0,0)7r(0,0) 





iV(0,0)[l -7r(0,( 
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and 



M 



22 



B 



l-2p 



F*5 
F\l-S) 



M 



12 



Thus, det(D*M-iD) = detMgaS which is given by 



det fj;^^ det 



A^(0,0)7r(0,0) 



det det 



F\i - s)ii - syFn- 

A^(0,0)[l-7r(0,0)] 



In 



Applying Sylvester's determinant theorem, we obtain that 



detri^Met 



iV(0, 0)^(0,0) 



detn 



A 



1 + 



A^(0,0)7r(0,0) 



Analogously for treatment B, so that (3.5) follows easily after simple algebra. 
C3~C4. Clearly tr(n-^D*M~iD) =tTM22, so criterion C3 is given by 



(A.5) 



+ tr 



Af(0,0)[l 



Note that tr(n ^(T^E*M ^E) coincides with (A.5) and thus C4 is equal 
to C3. Since 



r 1 


-r iRC 1 


-c-iB*r-i 


c-i + c-iB*r~iBC-i 



we obtain tr(n^^) = tr(r-i) + tr(C-i) + tr(C"iB*r~iBC"i). From (A.3) 
and (A. 4), it follows that 



J L 



tr(C-^B*r-^BC-^) = J^J]] 



-+- 



L 



J 



tU^N{j,o) N{o,i)J ^iv(j,o) frtNio,i) 



and thus tr(r2^"^) is 

J L 



EE 



1 1 
+ ■ 



J L 



J L 



1 1 
■ + - 



ttt\N{o,i) N{j,o)J ^,frt\N{j,i)J ^tttrViV(o,0 N{j,0) 

Moreover, from (A.2), (A.3) and (A.4) we obtain d^FCl-^^ = (l*j_^^, -l*j.^) 
and thus fl'^^F^SS^Ffl'^^ has unitary diagonal elements, so that 
tr(ri^^F*55*FfJ^^) = J + L + J ■ L. Analogously for treatment B. 
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A. 2. Proof of Theorem 7.1. As regards the first statement (7.7), at each 
stratum {tj,wi) we will prove the convergence of both 7r„(j, I) and vf* (j, I) to 
the target TT*{j, I); for ease of notation we will often omit the subscript (j, I) 
assuming that we are fixing the stratum {tj,wi). Let 5„ = (t(Yi, . . . , y„, 5i, . . . , 
6m Zi, . . . , Zn) denote the cr-field representing the history of the trial, with 
5o the trivial cr-field, and 6„ = f(5^n, ■^n+i)- Moreover, let AMj = [6i — 
={tj,wi)}^ then {AMj;i > 1} is a sequence of bounded mar- 
tingale differences with |AMj| < 1 for any i > 1; thus the sequence {M„ = 
J27=i^Mi-(Sn} is a martingale with ^Li ^K^^O^I^fc-i] < ^'n- 

Let In = max{s : 2m + 1 < s < n,Ns — NgTTg < 0}, with max0 = 2m, and 
note that 

n n 

n 

<Ni^ + i + Mn-Mi^+i+ Y ¥'(^fc-i;^Li;n-i)i{z,.=(i,,«;0} 

n 

k=l k=l 

since for any i > /n + 1, '^('''"i; 7r*;pj) < vf* < VTj; whereas Ni^ < NiJUl^ and 
thus 

Since p{j,l) > for each stratum {tj,wi), then as n — oo, iV„ — oo a.s. 
and therefore N~^Mn — t- a.s.; see, for instance, [21]. Furthermore, as n— t- 
oo, p„ — 7- p a.s. and at least one of the the number of assignments to the 
treatments, namely Nn and {Nn — Nn), tends to infinity a.s. As showed 
in [14], in any case 7„ has finite limit so that, from the properties of vr*, 
there exists a f S (0, 1) such that 

(A. 6) vr* — )• t; a.s. 

and so 



Ylk=l'^{Zk=(tj,wi)} 



a.s. 
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As n — )■ oo, then — oo or sup„ /„ < oo; in either case, 

^* Ni„ T!k=i%-iHzt,={ti,wi)} 

Til ~i^T — — sr^n a.S. 

"^n Lfc=ll{Zfc = fe,i«,)} 

and therefore 

(A. 7) (7r„ - 7?;^)+ ^ a.S. 

Let A„ = max{s ■.2m + I < s < n,{Ns - Ng) - Ns{l - vf*) < 0}, for any i > 
Xn + 1, we have ip{7ri;TT*;pi) = 1 - 99(1 - vrj; 1 - Tf*;pi) > vf* > Tr,. Then, 

n 

Nn-Nn = Nx„+l-Nx^+l+ Yl m^-^k)\&k-l)l{Z, = (t,,n.O} 
n 

+ E [(l-<5fc)-i^^((l-4)|©fe~i)]l{z,=(t,,«,,)} 

k=Xn-\-'2 

<Nx„ + l-Nx,.-{Mr,-My^+,) 

n 

k=Xn-\~2 

n 

<iVA„ + l-iVA„-(M„-MA„+i)+ {l-r^_^)l{z,=it„wO} 

k=Xn-\-2 
n 

= Nx^ + l- Nx^ - (Mn - Mx^+l) + ^(1 - 7f^_i)l|z, = (J,,»,)} 

k=l 

X-a + l 

k=l 

Hence, 

iNn-Nn)-Nnil-K) 



I fc=l 



'k = {tj,Wl)} 



+ 1 - (M„ - Ma„+i) - iV„(l - - ^(1 - r^_^)l{z,=it,,^,)} , 



fc=l 



so that 

(A.S) ((1 - 7r„) - (1 ^ a.S. 
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From (A. 7) and (A. 8), as n tends to infinity 



a.s. 



and by (A. 6) 



lim 7r.„ = lim vr* = v a.s. 



Since < f < 1, then < 1 — v < 1 and thus 



hm Nn — ?• oo a.s. and hm (N^ — Nn) — )• oo a.s. 

Therefore, hm„_j.oo 7n ~^ 7 ^-s- and from the continuity of the target 
hm„_j.oo = TT* = V a.s., that is, 

(A. 9) Hm TTn = vr* a.s. 



Taking into account the average information matrix M in (2.3), from (A. 9) 
and the proof of Proposition 3.1 it foUows that 



J L 



hm n ^ = V V7r*(j,/)p(j,/) a.s. 

n— J-oo ^ — ' ^ — ' ^ — ' 
j=0 1=0 



i=l 



hm n~^5*F = tt** a.s. and hm n"^F*AF = M' 



a.s. 



Thus, as n goes to infinity the information matrix converges almost surely 
to A4{7v*), which is nonsingular since X]/=o OpOj ^ (0, 1), Ai^ 

and Ad^ are nonsingular, and the matrix 



/ J L 

j;j;7r*(j,0p(j,0 

j=0 1=0 





S^F 

{1-syF 





J L 
j=0 1=0 



M 



B 



F^S 
F\l-S) 



7r*(0,0)p(0,0) 

1 -7r*(0,0)p(0,0) 

is nonsingular too, since 7r*(0, 0)p(0, 0) € (0,1). Thus, the asymptotic nor- 
mahty of 7„ follows directly. 



Acknowledgments. The authors thank the referees and the Associate Ed- 
itor for their helpful comments, which led to a substantial improved version 
of the paper. 



THE REINFORCED DOUBLY-ADAPTIVE BIASED COIN DESIGN 



31 



SUPPLEMENTARY MATERIAL 

Supplement to "Multi-objective optimal designs in comparative clinical 
trials with covariates: the reinforced doubly adaptive biased coin design" 

(DOT: 10. 1214/12- AOS1007SUPP; .pdf). An online supplementary file con- 
tains the extension of inferential criteria C1-C5 to the case of several co- 
variates. 
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